Hybrid Implementation and Performance Analysis for High Performance Computation Workload
نویسنده
چکیده
Given the need to achieve maximum performance possible, offloading intensive computation workload to GPU is a key to achieve this goal. Offloading most of the workload to GPU may not results in desired performance, so a middle approach is more suitable such as splitting the workload between the CPU and the GPU can be considered as an optimized approach. In this study, we used a popular high performance computation workload which can also be implemented using a hybrid approach in which part of the workload is offloaded to the CPU. We also present a performance estimation method which is verified to estimate performance with in 5% error margin.
منابع مشابه
Parallelization of the Treecode Algorithm for N-Body Simulation Using MPI, Hybrid, and GridRPC Programming Paradigms
This dissertation describes the parallelization of the treecode algorithm for N-Body problem and performance comparison among three different parallel programming paradigms, MPI, hybrid MPI-OpenMP, and GridRPC. In N-Body simulation, the specific routine for calculating the forces on the bodies which accounts for upwards of 90% of the cycles in typical computations is eminently suitable for obta...
متن کاملComparing performance of organization on implementation of customer relationship management systems using ANP and TOPSIS hybrid approach
As the customers are the main reason of the formation and survival of the organization, not only understanding their obvious needs, but also forecasting, determining and guiding their hidden needs, design and implementing plans of offering services for meeting these needs for attracting customers are among cornerstone of any activity in the organization. In this research, one compares the perfo...
متن کاملA Novel Hybrid-Excited Modular Variable Reluctance Motor for Electric Vehicle Applications: Analysis, Comparison, and Implementation
A variable reluctance machine (VRM) has been proven to be an outstanding candidate for electric vehicle (EV) applications. This paper introduces a new double-stator, 12/14/12-pole three-phase hybrid-excited modular variable reluctance machine (MVRM) for EV applications. In order to demonstrate the superiorities of the proposed structure, the static torque characteristics and dynamic performance...
متن کاملHPC Selection of Models of DNA Substitution for Multicore Clusters
This paper presents the High Performance Computing (HPC) support of jModelTest2, the most popular bioinformatic tool for the statistical selection of models of DNA substitution. As this task can demand vast computational resources, especially in terms of processing power, jModelTest2 implements three parallel algorithms for model selection: (1) a multithreaded implementation for shared memory a...
متن کاملارزیابی بارکاری ذهنی کنترلر های ترافیک هوایی بر اساس فاکتورهای باروظیفه در شبیه ساز کنترل ترافیک هوایی
Background and aim: Air traffic control has known as a complex cognitive task, which requires controller to focus on task for long time. Mental workload plays an important role in the performance of controllers. The aim of this study was to assess the workload of air traffic controller on the basis of task load factors. Methods: The present descriptive-analytical study was conducted among fo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JCS
دوره 10 شماره
صفحات -
تاریخ انتشار 2014